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(57) Abstract: This invention relates to methods and reagents for selecting a desired protein or nucleic acid molecule by linking 
^J, niRNA, with known or unknown sequences, to its translated protein to form a cognate pair. The cognate pair is selected based upon 

desired properties of the protein or the nucleic acid. This method also includes the evolution of a desired protein or nucleic acid 
Q molecule by amplifying the nucleic acid portion of the selected cognate pair, introducing variation into the nucleic acid, translating 
^ the nucleic acid, attaching the nucleic acid to its protein to form a second cognate pair, and re-selecting this cognate pair based upon 
r desired properties. 
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SELECTION ANO EVOLUTION OF NUCLEIC ACIDS AMI) POLYPEPTIDES 

5 Field of t he Invention 

The present invention relates to compositions and methods for the selection of 
nucleic acids and polypeptides. 

Background of the Invention 

1° Ligand-reccptor interactions are of interest for many reasons, from elucidating 

basic biological site recognition mechanisms to drug screening and rational drug design. It 
has been possible for many years to drive in vitro evolution of nucleic acids by selecting 
molecules out of large populations that preferentially bind to a selected target, then 
amplifying and mutating them for subsequent rc-sclcction (Tucrk and Gold, Science 

15 249:505 (1990), herein incorporated by reference). 

The ability to perform such a selection process with proteins would be extremely 
useful. This would permit in vitro design and production of proteins that bind specifically 
to chosen ligands. The use of proteins, as compared to nucleic acids, is particularly 
advantageous because the twenty diverse amino acid side chains in proteins have far more 

20 binding possibilities than the four similar chains in nucleic acid side. Further, many 
biologically and medically relevant ligands bind proteins. 

Both nucleic acid and protein evolution methods require access to a large and highly 
varied population of test molecules, a way to select members of the population that exhibit 
the desired properties, and the ability to reproduce the sclccled molecules with mutated 

25 variations to obtain another large population for subsequent selection. 

Prior attempts to develop a protein evolution method were primarily limited by the 
inability of the proteins to reproduce themselves and the inability to link mRNA encoding a 
polypeptide with the translated product. Additionally, the generation of large peptide 
libraries and screening methods have, until recently, required that the process have an in 

30 ww expression step. Examples include yeast two- or three-hybrid, yeast display and phage 
display methods (Fields and Song, Nature 340:245 (1989); Licitra and IJu, PNAS 
93:12817 (1996); Bodcr and WiUrup, Nat Biotcchnol 15:553 (1997); and Scott and Smith, 
-1- 
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Science 249:386 (1990)). Jn vivo methods suffer from various disadvantages, including a 
limited library size and cumbersome screening steps. Additionally, undesirable selective 
pressures can be placed on the generation of variants by cellular constraints of the host. 

In vilto methods have been developed more recently, using prokaryotic and 

5 cukaryotic in vitro translation systems, such as ribosome display (Matthcakis ct al., PNAS 
91:9022 (1994); Wanes and PKickthun, PNAS 94:4937 (1997); Jermutus ct al., Current 
Opinion in Biotechnology 9:534 (1998), all herein incorporated by reference). These 
methods link the protein and its encoding mRNA with the ribosome, and the entire complex 
is screened against a iigand of choice. Potential disadvantages of this method include the 

10 large size of the ribosome, which could interfere with the screening of the attached, and 
relatively tiny, protein. 

In 1 997, two groups of workers developed an in vitro method of attaching a protein 
to its coding sequence during translation by using the ribosomal peptidyl transferase with 
puromycin attached to a linker DNA (Szostak ct al., International Patent Publication WO 

15 98/31700; Roberts and Szostak PNAS 94:12297 (1997); Ncmoto et al., FRBS Letters 

414:405 (1997), all herein incorporated by reference). Once the coding sequence and 
peptides are linked, the peptides are exposed to a selected Iigand. Selection or binding of 
the peptide by the Iigand also selects the attached coding sequence, which can then be 
reproduced by standard means. Both Roberts and Szostak and Nemoto et al. used the 

20 technique of attaching a puromycin molecule to the 3' end of a coding sequence by a DNA 
linker or other non-translatable chain. Puromycin is a tRNA acceptor stem analog which 
accepts the nascent peptide chain under the action of the ribosomal peptidyl transferase and 
binds it stably and irreversibly, thereby halting translation. These methods suffer from 
certain disadvantages. For example, the coding sequence encoding each peptide must be 

25 known and be modified both initially and between each selection. Thus, the methods of 

Roberts and Ncmoto cannot be used to select native unknown mRNAs. Further, the 
modification of the coding sequence adds several steps to the process. Finally, the attached 
puromycin on the linker molecules may compete in the translation reaction with the native 
tRNAs for the A site on the ribosome reading its coding sequence or a nearby ribosome, 

30 and could thus "poison" the translation process, just as would unattached puromycin in the 

translation reaction solution. Inadvertent interactions between puromycin and ribosomes 
could result in two kinds of reaction non-specificity: prematurely shortened proteins and 
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proteins attached to the wrong message. There arc reports in the prior art that indicate that 
the avidity of the A site and the peptidyl transferase for the puronvycin may be modulated 
by Mg" concentration (Roberts, Curr. Opin. Chcm. Biol. 3:26S (1999), herein incorporated 
by reference). Although Mg' ' concentration may be titrated to control for the first kind of 
non-specificity, i.e. premature termination of translation, it will not affect the second type, 
i.e. inaccurate mRNA-protcin linkage. 

Thus, a need exists for an in vitro nucleic acid-based protein evolution system that 
does not necessarily require initial knowledge of the nucleic acid's sequence or repeated 
chemical modification of the nucleic acids, and which can accurately link a mRNA to its 
protein. 

Summary of thp invention 

The present invention provides compositions and methods to select and evolve 
desired properties of proteins and nucleic acids. In various embodiments, the cuiTcnt 
invention provides modified tRNA's and tRNA analogs. Other embodiments include 
methods for generating polypeptides, assays enabling selection of individual members of 
the population of polypeptides having desired characteristics, methods for amplifying the 
nucleic acids encoding such selected polypeptides, and methods for generating new variants 
to screen for enhanced properties. 

In several embodiments, the present invention permits the attachment of a protein to 
its respective mRNA without requiring modification of native mRNA, although modified 
mRNA may still be used. The specificity of the methods embodied in various aspects of 
the current invention are determined by the specificity of the codon-anlicodon interaction. 

In a preferred embodiment, the invention permits the selection of nucleic acids by 
selecting the proteins for which they code. This may be accomplished by connecting the 
protein to its cognate mRNA at the end of translation, which in turn is done by connecting 
both the protein and mRNA to a tRNA or tRNA analog. 

A preferred embodiment of the invention includes a tRNA molecule capable of 
covalcntly linking a nucleic acid encoding a polypeptide and the polypeptide to the tRNA, 
wherein the linkage of the nucleic acid occurs on a portion of the tRNA other than the 
linkage to the polypeptide and wherein the tRNA comprises a linking molecule associated 
with the anticodon of the tRNA. This amicodon of the tRNA is capable of forming a 
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crosslink to the mRNA under irradiation with light of a required wavelength, preferably a 
furan-sided psoralen monoadduct on the anticodon irradiated with IJVA, preferably in the 
range of about 300-450 nm, more preferably in the range of about 320 to 400 nin, and most 
preferably about 365 nm. Preferably, an amino acid or amino acid analog is attached to the 
5 3' end of a tRNA molecule by a stahlc bond to generate a stable aminoacyl tRNA analog 

(SATA). 

Other embodiments include a mRNA comprising a psoralen, preferably located in 
the 3' region of the reading frame, more preferably at the most 3' codon of the reading 
frame, most preferably at the 3' stop codon of the reading frame. In preferred 

10 embodiments, linkage between the tRNA and the mRNA is a cross-linked psoralen 
molecule, more preferably a furan-sided psoralen monoadduct. 

A further embodiment of the invention provides a method of forming a 
monoadduct. According to this method a target oligonucleotide with at least one uridine 
and at least one modified uridine is contacted with psoralen, and the target olignucleotide 

1 5 and psoralen are coupled to form a monoadduct. The modified uridine according to this 
embodiment may be modified to avoid coupling with psoralen, and preferably the modified 
uridine is pseudouridinc. According to this embodiment the target oligonucleotide may be 
a tRNA molecule, such as tRNA, modified tRNA and tRNA analogs or a mRNA molecule, 
such as mRNA, modified mRNA and mRNA analogs. In a further embodiment the 

20 psoralen is coupled to the target oligonucleotide by one or more cross-links. According lo 
this embodiment a second oligonucleotide with a nucleotide sequence complementary to 
the target oligonucleotide sequence may be present. This second oligonucleotide may 
contain no uridine or may contain uridine residues that are modified to avoid cross-linking 
with the target oligonucleotide. Preferably, the modified uridine is pseudouridinc. 

25 Several embodiments of the present invention include a method of stably linking a 

nucleic acid, a tRNA, and a polypeptide encoded by the nucleic acid together to form a 
linked nucleotide-polypeptide complex. In a preferred embodiment, the nucleic acid is an 
mRNA and the linked nucleotide-polypeptide complex is a mRNA-polypcptidc complex. 
The method can further comprise providing a plurality of distinct nucleic acid-polypcptidc 

30 complexes, providing a ligand with a desired binding characteristic, contacting the 

complexes with the ligand, removing unbound complexes, and recovering complexes 
bound to the ligand. 

-4- 
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Several methods of (he current invention involve the evolution of nucleic acid 
molecules and/or proteins. In one embodiment, this invention comprises amplifying the 
nucleic acid component of the recovered complexes and introducing variation to the 
sequence of the nucleic acids. In other embodiments, the method further comprises 
5 translating polypeptides from the amplified and varied nucleic acids, linking them together 

using tRNA, and contacting them with the ligand to select another new population of bound 
complexes. Several embodiments of the present invention use selected protcin-mRNA 
complexes in a process of in vino evolution, in particular the iterative process in which the 
selected mRNA is reproduced with variation, translated and again connected to cognate 
1 0 protein for selection. 

Brief Description of the Drawings 
Figure 1 illustrates schematically one example of the complex formed by the 
mRNA and its protein product when linked by a modified tRNA or analog. As shown, a 

15 codon of the mRNA pairs with the anticodon of a modified tRNA and is covalcnlly 
crosslinkcd to a psoralen monoadduct by UV irradiation. The translated polypeptide is 
linked to the modified tRNA via the ribosomal pcplidyJ transferase. Hoth linkages occur 
while Ihc mRNA and nascent protein are held in place by the ribosome. 

Figure 2 illustrates schematically an example of the in vitro selection and evolution 

20 process, wherein the starling nucleic acids and their protein products arc linked (e.g., 
according to Figure 1) and arc selected by a particular characteristic exhibited by the 
protein. Proteins not exhibiting the particular characteristic arc discarded and those having 
the characteristic are amplified with variation, preferably via amplification with variation of 
the mRNA, to form a new population. In various embodiments, noribinding proteins will 

25 be selected. I hc new population is translated and linked via a modified tRNA or analog, 

and the selection process is repeated. As many selection and amplification/mutation rounds 
as desired can be performed to optimize the protein product. 

Figure 3 illustrates one method of construction of a tRNA molecule of the 
invention. In this embodiment, the 5' end of a tRNA, a nucleic acid encoding an anticodon 

30 loop and having a molecule capable of stably linking to mRNA (such as psoralen, as used 

in this example), and the 3' end of tRNA modified with a terminal puromycin molecule are 

-5- 
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ligated to form a complete modified tRNA for use in the in vitro evolution methods of the 
invention. 

Figure 4 describes two alternative embodiments by which the crosslinking molecule 
psoralen can be positioned such that it is capable of linking the mRNA with the tRNA in 
5 the methods of the invention. A first embodiment includes linking the psoralen 

monoadduct to the mRNA, and a second embodiment includes linking the psoralen to the 
anticodon of the tRNA. Psoralen can cither be monoadducted to the anticodon or the 3' 
terminal codon of the reading frame for known or partially known messages. This can be 
done in a separate procedure from translation, i.e. before translation occurs. 
10 Figure 5 illustrates the chemical structures for uridine and pscudouridinc. 

Pscudouridinc is a naturally occurring base found in tRNA that forms hydrogen bonds just 
as uridine does, but lacks the 5-6 double bond that is the target for psoralen. 

Detailed D escription of the Preferred Embodiment 

15 Various aspects of the present invention use a tRNA mechanism that links 

messenger RNA (mRNA) to its translated protein product, forming a "cognate pair." In 
several embodiments, mRNA whose sequence is not known can be expressed, its protein 
characterized through a selection process against a ligand with desired or selected 
properties, and nucleic acid evolution resulting in protein evolution can be performed in 

20 vitro to arrive at molecules with enhanced properties. The cognate pairs are preferably 

attached via a linking tRNA, modified tRNA, or tRNA analog. In a preferred embodiment, 
the tRNA is connected to the nascent peptide by the ribosomal peptidyl transferase and to 
the mRNA through an ultraviolet induced cross link between the anticodon of the tRNA or 
tRNA analog and the codon of the RNA message. This can be done by, for example 

25 thiouracil, but in a preferred method, the linker is a psoralen crosslink made from a 

psoralen monoadduct pre-placcd on cither the mRNA or preferably on the tRNA anticodon 
of choice. Preferably, a tRNA stop anticodon is selected. A stop codon/anticodon pair 
selects for full length transcripts. One skilled in the art will understand that a mRNA not 
having a stop codon may also be used and that any codon or nucleic acid triplet may be 

30 used. A tRNA having an anticodon which is not naturally occurring can be synthesized 
according to methods known in the art (e.g. Figure 3). 
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The terms "protein," "peptide," and "polypeptide" are defined herein to mean a 
polymeric molecule of two or more units comprised of amino acids in any form (e.g., D- or 
L- amino acids, synthetic or modified amino acids capable of polymerizing via peptide 
bonds, etc.), and these terms may be used interchangeably herein. 
5 The term "pseudo stop codon" is defined herein to mean a codon which, while not 

naturally a nonsense codon, prevents a message from being further translated. A pseudo 
stop codon may be created by using a "stable aminoacyl tRNA analog" or SAT A, as 
described below. In this manner, a pseudo stop codon is a codon which is recognized by 
and binds to a SATA. Another method by which to create a pseudo stop codon is to create 

10 an artificial system in which the necessary tRNA having an anticodon complementary to 
the pseudocodon is substantially depleted. Accordingly, translation will stop when the 
absent tRNA is required, i.e. at the pseudo stop codon. One skilled in the art will 
appreciate that are numerous ways to create a pseudo stop codon as defined herein. 

The formation of connections between mRNA and its protein product generally 

15 requires a tRNA or tRNA analog with certain characteristics. In several embodiments of 
the current invention, the tRNA or tRNA analog will have a stable peptide acceptor. This 
modification changes the tRNA or tRNA analog such that after it accepts the nascent 
peptide chain by the action of the ribosomai peptidyl transferase, it holds the chain in a 
stable manner such that the peptidyl transferase cannot detach it. This may be 

20 accomplished by using a bond such as a 2' ester on a 3' deoxy adenosine or an amino "acyl 

tRNA o*- re d" which can bind to the ribosome, accept the peptide chain, and then not act as a 
donor in the next transpeptidation (Chinaii et al., Biochem. 13:3001 (1974); Krayevsky 
and Kukhanova, Prog. Nuc. Acid Res 23:1 (1979) and Sprinzl and Cramer Prog. Nuc. Acid 
Res 22:1 (1979), all herein incorporated.by reference). 

25 In a preferred embodiment, an amino acid or amino acid analog is attached to the 3' 

end of the tRNA or tRNA analog by a stable bond. This stable bond contrasts the labile, 
high energy ester bond that connects these two in the native structure. The stable bond not 
only protects the bond from the action of the peptidyl transferase, but also preserves the 
structure during subsequent steps. For convenience, this modified tRNA or tRNA analog 

30 will be referred to as a "stable aminoacyl tRNA analog" or SATA. As used herein, a SATA 
is an entity which can recognize a selected codon such that it can accept a peptide chain by 
the action of the ribosomai peptidyl transferase when the cognate codon is in the reading 
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position of the ribosome. The peptide chain, will be bound in such a way that the peptide is 
bound stably and cannot be unattached by the peptidyl transferase. Preferably, the selected 
codon is recognized by hydrogen bonding. 

One method for creating a SATA was published in 1973 (Fraser and Rich, PNAS 
5 70:2671 (1973), herein incorporated by reference). This method involves the conversion of 
a tRNA, or tRNA analog, to a 3'-amino-3'-deoxy tRNA. This is accomplished by adding a 
3'-amino-3'-deoxy adenosine to the end of a native tRNA with tRNA nucleotidyl 
transferase after removing the native adenosine from it with snake venom 
phosphodiesterase; This modified tRNA is then charged with an amino acid by the 

10 respective aminoacyl tRNA synthetase (aaRS). Fraser and Rich used an aaRS in which the 
tRNA is charged on the 3', rather than the 2', hydroxy!. The amino. acid is bound to the 
tRNA by a stable amide bond rather than the usual labile high-energy ester bond. Thus, 
when it accepts a peptide from ribosomal peptidyl transferase it will stably hold the peptide 
and not be able to donate it to another acceptor. 

15 In a preferred method, the SATA will be attached to the translated message by a 

psoralen cross link between the codon and anticodon. Psoralen cross links are 
preferentially made between sequences that contain complementary 5' pyrimidine-purine 3' 
sequences especially UA or TA sequences (Cimino et al., Ann. Rev. Biochem. 54:1151 
(1985), herein incorporated by reference). The codon coding for the SATA, or the linking 

20 codon, can be PYR-PUR-X or X-PYR-PUR, so that several codons may be used for the 
linking codon. Conveniently, the stop or nonsense codons have this configuration. Using a 
codon that codes for an amino acid may require minor adjustments to the genetic code, 
which could complicate some applications. Therefore, in a preferred embodiment, a stop 
codon is used as the linking codon and the SATA functions as a nonsense suppressor in that 

25 it recognizes the linking codon. One skilled in the art, however, will appreciate that, with 
appropriate adjustments to the system, any codon can be used. 

Fraser and Rich did their work in E. coli, but the most effective in vitro translation 
systems are in eukaryotes The use of prokaryotic suppressors in eukaryotic translation 
systems appears to be feasible (Geller and Rich Nature 283:41 (1980); Edwards et al PNAS 

30 88:1 153 (1991); Hou and Schimmel Biochem 28:6800 (1989), alt herein incorporated by 
reference). They are primarily limited by the resident aaRS's. This limitation is overcome 
by various embodiments of the present invention because the tRNA or analog can be 
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charged in the prokaryotic system and then purified according to established methods 
(Lucas-Lenard and Haenni, PNAS 63:93 (1969), herein incorporated by reference). 

In several embodiments of the current invention, acceptor stem modifications 
suitable for use in the tRNAs and analogs can be produced by various methods known in 

5 the art. Such methods are found in, for example, Sprinzl and Cramer, Prog. Nuc Acid Res. 

22:1 (1979), herein incorporated by reference. In an alternative embodiment, 
"transcriptional tKNA", i.e. the sequence of the tRNA as it would be transcribed rather than 
after the post-transcriptional processing, leads to the atypical and modified bases that are 
common in tRNAs. These transcriptional tRNAs are capable of functioning as tRNAs 

10 (Dabrowski et al., EMBO J. 14: 4872, 1995; and Harrington et ai., Biochem. 32: 7617, 
1993, both herein incorporated by reference). Transcriptional tRNA can be produced by 
transcription or can be made by connecting commercial RNA sequences together, piece- 
wise as in Figure 3, or in some combination of established methods. For instance, the 5' 
phosphate and 3' puromycin are commercially available attached to oligoribonucleotides. 

1 5 Commercial RNA sequences are available from Dharmacon Research Inc., La Fayette, CO. 
This company can also provide modified native tRNA, such as sequences in which thymine 
is substituted for uricil and pseudouridine.) These pieces can be connected together using 
T4 DNA ligase, as is well-known in the art (Moore and Sharp, Science 256: 992, 1992, 
herein incorporated by reference). Alternatively, in a preferred embodiment, T4 RNA 

20 ligase is used (Romaniuk and Uhlenbeck, Methods in Enzymology 100:52 (1983), herein 
incorporated by reference). 

In several embodiments of the present invention, psoralen is monoadducted to the 
SATA by construction of a tRNA from pieces including a psoralen linked oligonucleotide 
(Fig. 3) or by monoadduction to a native or modified tRNA or analog (Fig. 4). In a 

25 preferred embodiment, psoralen is first monoadducted to an oligonucleotide containing part 
of the anticodon loop as described below and this product is then ligated to the remaining 
fragments of the SATA. 

In several embodiments, translation will stop when the nascent protein is attached to 
the SATA by the peptidyl transferase. When a large number of ribosomes are in this 

30 position the SATA and the mRNA will be connected with UV light. In a preferred method 
this will be accomplished by having a psoralen crosslink formed. Psoralens have a furan 
side and a pyrone side, and they readily intercalate between complementary base pairs in 
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double stranded DNA, RNA, and DNA-RNA hybrids (Cimino et al., Ann. Rev. Biochem. 
54:1151 (1985), herein incorporated by reference). Upon irradiation with LTV, preferably in 
the range of 320 nm to 400 run, cross linking will take place and leave the staggered 
pyrimidines covalently bound. By either forming crosslinks and photo reversing them or 
5 by using selected wavelengths, it is possible to form monoadducts, described more fully 

below. These will be either pyrone sided or furan sided monoadducts. Upon further 
irradiation, the furan sided monoadducts can be covalently crosslinked to complementary 
base pairs. The pyrone sided monoadducts cannot be further crosslinked. The formation of 
the furan sided psoralen monoadduct (MAf) is also done according to established methods. 
10 In a preferred method, the psoralen is attached to the anticodon of the SATA. However, 

psoralen can also be attached at the end of the reading frame of the message, as depicted in 
Figure 4. 

Methods for large scale production of purified MAf on oligonucleotides are 
described in the literature (e.g., Speilmann et al., PNAS 89:4514, 1992, herein incorporated 

15 by reference), as are methods that require less resources, but have some non-cross-linkable 
pyrone sided psoralen monoadduct contamination (e.g., U.S. Patent No. 4,599,303; Gamper 
et al., J. Mol. Biol. 197: 349 (1987); Gamper et al., Photochem. Photobiol. 40:29 (1984), 
both herein incorporated by reference). In several embodiments of the current invention, 
psoralen labeling is accomplished by using either method. In a preferred embodiment, 

20 furan sided monoadducts will be created using visible light, preferably in the range of 
approximately 400 nm - 420 nm, according to the methods described in U.S. Patent No. 
5,462,733 and Gasparro et a!., Photochem. Photobiol. 57:1007 (1993), both herein 
incorporated by reference. In one aspect of this invention, a SATA with a furan sided 
monoadduct or monoadducted oligonucleotides for placement on the 3' end of mRNAs, 

25 along with a nonadducted SATA are provided as the basis of a kit. 

In one embodiment, the formation and reversal of monoadducts and crosslinks are 
performed according to the methods of Bachellerie et al. (Nuc Acids Res 9:2207 (1981)), 
herein incorporated by reference. In a preferred embodiment, efficient production of 
monoadducts, resulting in high yield of the end-product, is accomplished using the methods 

30 of Kobertz and Essigmann, J. A. Chem. Soc. 1997, 119, 5960-5961 and Kobertz and 
Essigmann, J. Org. Chem. 1997, 62, 2630-2632, both herein incorporated by reference. 
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In a preferred embodiment, a SATA fragment and complementary RNA or DNA is 
used in which all of the uridines, except the target, are replaced by pseudouridine. Figure 5 
compares the chemical structures for uridine and pseudouridine. Pseudouridine is a 
naturally occurring base found in tRNA that forms hydrogen bonds just as uridine does. 
5 This embodiment is particularly advantageous because the pseudouridine forms the same 

Watson-Crick hydrogen-bonds as the native uridine but lacks the 5-6 double bond that is 
the target for interacting with either the furan or pyrone side of the psoralen molecule. This 
permits the same base-pairing characteristics as an oligonucleotide with uridine, but 
provides only one target for the psoralen. Because the pyrone side linkage is usually 

10 formed after the furan side has reacted, this removal of a staggered target allows the 
monoadduct to be formed with high efficiency irradiation without forming crosslinks and 
with minimal formation of pyrone sided monoadduct (MaP). Irradiation is preferably in the 
range of about 300-450 nm, more preferably in the range of about 320 to 400 nm, and most 
preferably about 365 nm. More specifically, a pseudouridine on the SATA permits: 1) the 

15 use of SATA sequences that contain uridines which are potential targets for the psoralen 
and 2) on the cRNA or cDNA, eliminate the formation of crosslinks, leaving the process 
stopped at furan sided monoadduct (MaF) formation when using UVA wavelengths which 
are much more efficient than visible light. Use of the SATA and the monoadduct in 
several embodiments of the current invention is particularly advantageous for in vitto 

20 translation systems. However, one skilled in the art will appreciate that in situ systems can 
also be used. Various embodiments of the current invention will be applicable to any in 
vitro translation system, including, but not limited to, rabbit reticulocyte lysate (RLL), 
wheat germ, E. coli, and yeast lysate systems. Many embodiments of the current invention 
are also well-suited for use in hybrid systems where components of different systems are 

25 combined. 

tRNAs aminoacylated on a 3' amide bond are reported not to combine with the 
elongation factor EF-TU which assists in binding to the A site (Sprinzl and Cramer, Prog. 
Nuc. Acid Res. 22:1 (1979), herein incorporated by reference). Such modified tRNAs do,, 
however, bind to the A site. This binding of 3' modified tRNAs can be increased by 
30 changing the Mg ++ concentration (Chinali et al, Biochem. 13:3001 (1974), herein 

incorporated by reference). The appropriate concentrations and/or molar ratios of SATA 
and Mg** can be determined empirically. If the concentration or A site avidity of SATA is 
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too high, the SATA could compete with native tRNAs for non-cognate codons i.e., could 
function much like puroniycin and stall translation. If the concentration or A site avidity of 
SATA is too low, the SATA might not effectively compete with the release factors, i.e., it 
would not act as an effective nonsense suppressor tRNA. The balance between these can 
5 be determined empirically. 

It is also believed that the elongation factor aids in proofreading the codon- 
anticodon recognition. The error rate in the absence of elongation factor and the associated 
GTP hydrolysis is estimated to be 1 in 100 for codons one nucleotide away (Voet and Voet, 
Biochemistry 2 nd ed. pp. 1000-1002 (1995), John Wiley and Sons, herein incorporated by 

1 0 reference). In a preferred embodiment, UAA is used as the Unking codon^ For UAA as the 
linking codon, there are 7 non stop codons which differ by one amino acid. This is 7/61 or 
about 1 1 .5% of the non stop codons. One can estimate the probability of miscoding a given 
codon as (0.01)(0.1 15) = 1.15 x 10 3 miscodes per codon. Thus, one would expect a 
miscode about every 870 codons, a frequency which will not substantially impair 

15 performance of various methods of the current invention. In an alternative embodiment, 
UAGor UGA is used as the linking codon. 

In several embodiments, appropriate concentrations of SATA and Mg** are used in 
the in vitro translation system, e.g. RRL, in the presence of the mRNA molecules in the 
pool, causing translation to cease when the ribosome reaches the codon which permits the 

20 SATA to accept the peptide chain (the linking codon described above). Within a short 

time, most of the linking codons will be occupied by SATAs within ribosomes. In a 
preferred embodiment, the system then will be irradiated with UV light, preferably at 
approximately 320 nm to 400 nm. Nucleic acids are typically transparent to, i.e. do not 
absorb, this wavelength range. Upon irradiation, the psoralen monoadduct will convert to a 

25 crosslink connecting the anticodon and the codon by a stable covalent bond. 

In a preferred embodiment, the target mRNA is pre-selected. In another 
embodiment, the target mRNA is artificially produced. In an alternative embodiment, the 
target consists of messages native to the system under investigation, which may be 
unknown and/or unidentified. The ability to use unknown and/or unidentified mRNAs is a 

30 particular advantage of several embodiments of the current invention. 

In several embodiments, once all the nascent proteins are connected to their cognate 
mRNAs, the ribosomes are released or denatured. Preferably, this is accomplished by the 
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depletion of Mg" through dialysis, simple dilution, or chelation. One skilled in the art will 
understand that other methods, including, but not limited to, denaturation by changing the 
ionic strength, the pH, or the solvent system can also be used. 

In several embodiments of the invention, the selection of cognate pairs will be based 
upon affinity binding of proteins according to any of a variety of established methods, 
including, but not limited to, affinity columns, immunoprecipitation, and many high 
throughput screening procedures. A variety of ligands may also be used, including, but not 
limited to, proteins, nucleic acids, chemical compounds, polymers and metals. In addition, 
cell membranes or receptors, or even entire cells may be used to bind the cognate pairs. 
The selection can be positive or negative. That is, the selected cognate pairs can be those 
that do bind well to a ligand or those that do not. For instance, for a protein to accelerate a 
thermodynamically favorable reaction, i.e., act as an enzyme for that reaction, it should 
bind both the substrate and a transition state analog. However, the transition state analog 
should be bound much more tightly than the substrate. This is described by the equation 



where the ratio of the rate of the reaction with the enzyme, k em>me , to the rate without, 
t^w, is equal to the ratio of the binding of the transition state to the enzyme /Trover the 
binding of the substrate to the enzyme K sllbs , (Voet and Voet, Biochemistry 2 nd ed. p.380, 
(1995), John Wiley and Sons, herein incorporated by reference). 

In a preferred embodiment, proteins which compete poorly for binding to the 
substrate but compete well for binding to the transition state analog are selected. 
Operationally, this may be accomplished by taking the proteins that are easily eluted from a 
matrix with substrate or substrate analog bound to it and are the most difficult to remove 
from matrix with transition state analog bound to it. By sequentially repeating this 
selection and reproducing the proteins through replication and translation of the nucleic 
acid of the cognate pairs, an improved enzyme should evolve. Affinity to one entity and 
lack of affinity to another in the same selection process is used in several embodiments of 
the current invention. Selection can also be done by RNA in many embodiments. 
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Once the selection has identified a population of cognate pairs it may be convenient 
to detach the mRNA strand from the SATA to reproduce it. This is not always necessary 
but can be accomplished by irradiating the pairs with UV, preferably at approximately 313 
nrn or just below. This has been identified as a wave length that will photoreverse the 
5 psoralen crosslink to MAf and damage the nucleic acid minimally. The ratio of 
photoreversal to nucleic acid damage is estimated to be 1 photoreversal for damage to 1 in 
600 bases (Cimino et al., Biochem 25:3013(1 986), herein incorporated by reference). 

One skilled in the art will appreciate that the mRNAs can be reproduced in many 
ways including, but not limited to, by RNA-dependent RNA polymerases or by reverse 

10 transcription and PCR. This can take place using mRNAs separated from the cognate pairs, 
e.g., using poly T or poly U to hybridize to the poly A tails of, for instance, native unknown 
messages or by leaving the cognate pairs intact and using oligonucleotide primers that 
hybridize partially into the reading frame for known messages. Alternatively, commercial 
kits for rapid amplification of cDNA ends may be used. When this is used to evolve 

1 5 proteins and not just to select them, it would be preferable to sample at least one amino acid 
substitution at each position in the protein. 
The Replication Threshold 

A nominal minimum number of replications for efficient evolution may be 
estimated using the following formulae. If there is a sequence which is n sequences in 

20 length, with a selective improvement r mutations away with a mutation rate of p, the 
probability of generating the selective improvement on replication may be determined as 
follows: 

For r =1 , probability of a mutation at the right point, p, times the probability that it 
mutated to the right one of the three nucleotides that are different from the starting point, 
25 1/3, times the probability that the other n-1 sites remain unroutated, (l-p)(n-r), or 

/>,=(f)'<i-/.r" 

where, P - the probability of attaining a given change r mutations away. More generally, 
for all r values: 
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It is instructive to compare the chances of finding an advantage one mutation away 
with the chances three mutations away. This is because, given the triplet genetic code, any 
given codon can only change into nine other codons in one mutation. Indeed, it turns out 
that no codon can actually change into nine other amino acid codes in one mutation. The 
maximum number of amino acids that can be accessed in one mutation is seven amino 
acids and there are only eight codons of the sixty-four that can do this. Most codons have 
five or six out of nineteen other amino acids within one mutation. To reach all nineteen 
amino acids that are different from the starting one requires, in general, three mutations. 
These three mutations cannot be sequential since the two intervening ones will not, in 
general, be selectively advantageous. Therefore we need to use steps that are, at least, three 
mutations in size (r=J) to use all 20 amino acids. 

For a mutation rate of .0067, which is that reported for "error-prone PCR" using a 
message of 300 nucleotides, which gives a short protein of 100 amino acids: 
P 3 -1.51xl0" 9 

Therefore, one would expect to need a threshold of : 

^ = 6.64xl0 8 

1.51 xlO 9 

replications at that mutation rate to reasonably expect to reach the next amino acid that is 
advantageous. This is not the replication to use since the binomial expansion shows that 
over 1/3 of trials (actually about 1/e) would not contain the given sequence with selective 
advantage. 

A poisson approximation for large n and small p for a given p can be calculated so 
that we can compute the general term when n is, say, of the order 10 9 and/? is of the order 
10' 9 . The general term of the approximation is: 

vL 
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An amplification factor of greater than approximately 6/P ensures that evolution 
will progress with the use of all amino acids. This is useful when the production of novel 
proteins precludes the use of "shuffling" of preexisting proteins. 
Limits on Purification 

Given a reversible binding where B and C compete for A: 
AB++A+B AC++A + C 





"TaJ 



(i) 




The total concentrations can be expressed as follows: 



[c] r =[c]+[^c] 



(3) 



(4) 



Dividing (3) by (4): 




And substituting (1) and (2) for [B] and [CJ: 



[B] T =k B if + [ab} 
A J 



lC]r=*c[^J + [>«C] 
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Rearranging the equation gives the following results: 




Canceling the [A]'s in the numerator and denominator: 
[B] r jAB](k B+ [A} 

Finally, rearranging the equation provides the following equation: 



[AC[ [C] T (k B + [A]) 



(*c+[4) 

(* +[A]) (Enrichment Factor) 

The above factor is termed the "Enrichment Factor". The ratio of the total 
components is multiplied by this factor to calculate the ratio of the bound components, or 
the enrichment of B over C. The maximum enrichment factor is kJk B , when the [A] is 
significantly smaller than k c or k B . 

The enrichment is limited by the ratio of binding constants. To enrich a scarce 
protein that is bound 100 times as strongly as its competitors, the ratio of that protein to its 
competitors is increased by 1 million with 3 enrichments. To enrich a protein that only 
binds twice as strongly than its competitors, 10 enrichment cycles would gain only an 
enrichment of -1000. 

By an exactly analogous method an enrichment factor of selecting proteins that bind 
least well can be shown: 

In the equation: 



[C] k c [C] T dA] + k B ) 

m~ k^Akkj 
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The enrichment here is maximal at [A]>k A or k B . 
k B i.A]+kc) 

The following Examples illustrate various embodiments of the present invention 
and are not intended in any way to limit the invention. 

EXAMPLE 1 ; PRODUCTION OF THE SAT A USING URIDINE 

One skilled in the art will understand that the SATA can be produced in a number of 
different ways. For example, in a preferred embodiment, three fragments (Fig. 1) were 
purchased from a commercial source (i.e. Dharmacon Research Inc., Boulder, CO). 
Modified bases and a fragment 3 with a pre-attached puromycin on its 3' end and a P0 4 on 
its 3* end were included, all of which were available commercially. Three fragments were 
used to facilitate manipulation of the fragment 2 in forming the monoadduct. 

Yeast tRNAAla or yeast tRNAPhe were used; however, sequences can be chosen 
from widely known tRNA's or by selecting sequences that will form into a tRNA-like 
structure. Preferably, sequences with only a limited number of U's in the portion that 
corresponds to the fragment 2 are used. Using a sequence with only a few U's is not 
necessary because psoralen preferentially binds 5UA3' sequences (Thompson J.F., et al 
Biochemistry 21:1363, herein incorporated by reference). However, there would be less 
doubly adducted product to purify out if such a sequence was used. 

Fragment 2 was preferably used in a helical conformation to induce the psoralen to 
intercalate. Accordingly, a complementary strand was required. RNA or DNA was used, 
and a sequence, such as poly C to one or both ends, was added to facilitate separation and 
removal after monoadduct formation was accomplished. 

Fragment 2 and the cRNA were combined in buffered 50 mM NaCl solution. The 
Tm was measured by hyperchromicity changes. The two molecules were re-annealed and 
incubated for 1 hour with the selected psoralen at a temperature ~10°C less than the Tm. 
The psoralen was selected based upon the sequence used. A relatively insoluble psoralen, 
such as 8 MOP, could be selected which has a higher sequence stringency but may need to 
be replenished. A more soluble psoralen, such as AMT, has less stringency but will fill 
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most sites. Preferably, HMT is used. If a fragment 2 is chosen that contains more non- 
target U's, a greater stringency is desired. Decreasing the temperature or increasing ionic 
strength by adding Mg^ was also used to increase the stringency. In a preferred 
embodiment, MG^ was omitted and ~400 mM NaCl solution was used. 
5 Following incubation, psoralen was irradiated at a wavelength greater than 

approximately 400 nm. The irradiation depends on the wavelength chosen and the psoralen 
used. For instance, approximately 419 nm 20-150 J/cm2 was preferably used for HMT. 
This process results in an almost entirely furan sided monoadduct. 

10 PURIFICATION OF A MONOADDUCT 

The monoadduct was then purified by HPLC as described in S as try et al, J. 
Photochem. Photobiol. B Biol. 14:65-79, herein incorporated by reference. The fact that 
fragment 2 was separate from fragment 3 facilitated the purification step because, 
generally, purification of monoadducts 25 meris difficult (Spielmann et al. PNAS 89: 
15 45 1 4-45 1 8, herein incorporated by reference). 

LIGATION OF FRAGMENT 2 AND 3 

The fragment 2 was ligated to the fragment 3 using T4 RNA ligase. The puromycin 
on the 3' end acted as a protecting group. This is done as per Romaniuk and Uhlenbeck, 
20 Methods in Enzymology 100:52-59 (1983), herein incorporated by reference. Joining of 

fragment 2+3 to the 3* end of fragment 1 was done according to the methods described in 
Uhlenbeck, Biochemistry 24:2705-2712 (1985), herein incorporated by reference. 
Fragment 2+3 was 5' phosphorylated by polynucleotide kinase and the two half molecules 
were annealed. 

25 In an alternative method, significant quantities of furan sided monoadducted U were 

formed by hybridizing poly UA to itself and irradiating as above. The poly UA was then 
enzymatically digested to yield furan sided U which was protected and incorporated into a 
tRNA analog by nucleoside phosphoramidite methods. Other methods of forming the 
psoralen monoadducts include the methods described in Gamper et al., J. Mol. Biol. 197: 

30 349 (1987); Gamper et al., Photochem. Photobiol. 40:29, 1984; Sastry et al, J. Photochem. 

Photobiol. B Biol. 14:65-79; Spielmann et al. PNAS 89:4514-4518, U.S. Patent Number 
4,599,303, all herein incorporated by reference. 
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SATAs generated by the methods described above read UAG (anticodon CUA). 
Additionally, UAA or UGA were also used. In various embodiments, any message that had 
the stop codon that was selected as the "linking codon" was used. 

5 PRODUCTION OF PSORALEN ATED FURAN SIDED MONOADDUCTS FROM 

CUAGATCUGGAGG RNA FRAGMENTS 

VV Light Exposure OfRNA:DNA Hybrids 

Equal volumes of 3 ng/ml RNAxRNA hybrid segments and of 1 0 ug/ml HMT both 
comprised of 50mM NaCl were transferred into a new 1.5 ml capped polypropylene 
10 microcentrifuge tube and incubated at 37°C for 30 minutes in the dark. This was then 
transferred onto a new clean culture dish. This was positioned in a photochemical reactor 
(419 nm peak Southern New England Ultaviolet Co.) at a distance of about 12.5 cm so that 
irradiance was -6.5 mW/cm2 and irradiated for 60-120 minutes. 

Removal of Low Molecular Weight Protoproducts 
15 lOOul of chloroform-isoamyl alcohol (24:1) was pipetted and mixed by vortex. The 

mixture was centrifuged for 5 minutes at 15000 xg in a microcentrifuge rube. The 
chloroform-isoamyl alcohol layer was removed with a micropipet. The chloroform- 
isoamyl alcohol extraction was repeated once again. Clean RNA was precipitated out of 
the solution. 

20 Alcohol Precipitation 

Two volumes (-1000 jd) ice cold absolute ethanol was added to the mixture. The 
tube was centrifuged for 15 minutes at 15,000xg in a microcentrifuge. The supernatant was 
decanted and discarded and the precipitated RNA was redissolved in lOOul DEPC treated 
water then re-exposed to the RNA+8-MOP. 

25 Isolation of the Psoralentated RNA Fragments Using HPLC 

All components, glassware and reagents were prepared so that they were RNAase 
free. The HPLC was set up with a Dionex DNA PA-100 package columa The 
psoralenated RNA.DNA hybrid was warmed to 4°C. The psoralenated RNA was applied to 
HPLC followed by olignucleotide analysis, as described in the following section entitled 

30 "Olignucleotide Analysis by HPLC." The collected fractions represented: 
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• 5'CUAGA V PCUGGAGG3*, where V is pseudouridine (SEQ ID NO: 1 ) 

• Furan sided 5*CUPsoralenAGA^CUGGAGG3' monoadducts (SEQ ID NO: 2) 

• 5'XXXXXCCUCCAGAUCUAGXXXXX3* (SEQ ID NO: 3) 

• 5 , XXXXXCCUCCAGAUCUPsoralenAGXXXXX3 , (SEQ ID NO: 4) 

5 The fractions were stored at 4°C in new, RNAase free snapped microcentrifuge 

tubes and stored at -20°C if more than four weeks of storage were required. 

Identification of the RNA Fragments Represented by Each Peak Fraction Collected by 
HPLC Using Polyacrylamide Gel Electrophoresis (PAGE) 

The electrophoresis unit was set up in a 4°C refrigerator. A gel was selected with a 

10 2 mm spacer. Each 5 jil of HPLC fraction was diluted to 10 ul with Loading Buffer. 10 |xl 
of each diluted fraction was loaded into appropriately labeled sample wells. The tracking 
dye was loaded in a separate lane and electrophoresis was run as described in the following 
section entitled "Polyacrylamide Gel Electrophoresis (PAGE) of Psoralenated RNA 
Fragments." After the electrophoresis run was complete, the electrophoresis was stopped 

15 when the tracking dye reached the edge of the gel. The apparatus was disassembled. The 
gel-glass panel unit was placed on the UV light box. UV lights were turned on. The RNA 
bands were identified. The bands appeared as denser shadows under UV lighting 
conditions. 

Extraction of the RNA From the Gel 

20 Each band was excised with a new sterile and RNAase free scalpel blade and 

transferred into a new 1.5 ml snap capped microcentrifuge tube. Each gel was crushed 
against the walls of the microcentrifuge tubes with the side of the scalpel blade. A new 
blade was used for each sample. 1 .0 ml of 0.3M sodium acetate was added to each tube and 
eluted for at least 24 hours at 4°C. The eluate was transferred to a new 0.5 ml snap capped 

25 polypropylene microcentrifuge tube with a micropipet. A new RNAase free pipette tip was 
used for each tube and the RNA with ethanol was precipitated out. 

Ethanol Precipitation 
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Two volumes of ice cold ethanol was added to each eluate then centrifuged at 
15,000 xg for 15 minutes in a microcentrifuge. The supernatants were discharged and the 
precipitated RNA was re-dissolved in 100 ui of DEPC treated DI water. The RNA was 
stored in the microcentrifuge tubes at 4°C until needed. The tubes were stored at -20°C if 
storage was for more than two weeks. The following was order of rate of migration for 
each fragment in order from fastest to slowest: 

' a) S-CUAGA^CUGGAGGS' (SEQ ID NO: 1 ) 

b) Furan sided S'CUPsoralenAGA^CUGGAGGS' monoadducts (SEQ ID 
NO:2) 

c) 5'XXXXXCCUCCAGAUCUAGXXXXX3' (SEQ ID NO: 3) 

d) S'XXXXXCCUCCAGAUCUPsoralenAGXXXXXS* (SEQ ID NO: 4) 
The tubes containing the remainder of each fraction were labeled and stored at - 

20°C. 

ETHANOL PRECIPITATION 

RNA oligonucleotide fragments were precipitated, and all glassware was cleaned to 
remove any traces of RNase as described in the following section entitled "Inactivation of 
RNases on Equipment, Supplies, and in Solutions." All solutions were stored in RNAase 
free glassware and introduction of nucleases was prevented. Absolute ethanol was stored at 
0°C until used. Micropipetors were used to add two volumes of ice cold ethanol to nucleic 
acids that were to be precipitated in microcentrifuge tubes. Capped microcentrifuge tubes 
were placed into the micro fuge and spun at 15,000 xg for 1 5 minutes. The supernatant was 
discarded and precipitated RNA was re-dissolved in DEPC treated Dl-water. RNA was 
stored at 4°C in microcentrifuge tubes until ready to use. 

LIGATION OF RNA FRAGMENTS 2 AND 3 

All glassware was cleaned to remove any traces of RNase as described in the 
following section entitled "Inactivation of RNases on Equipment, Supplies, and in 
Solutions." The following was added to a new 1.5 ml polypropylene snap capped 
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microcentrifuge rube using a 100-1000 ul pipet and a new sterile pipet tip was used for 
each solution: 



Fragment 2 (3.0nM) 1 25.0u) 

Fragment 3 (3.0nM) 125.0ul 

5 Reaction buffer 250.0ul 

RNA T4 ligase (9-12U/ml) 42ul 

Reaction Buf fe r 

RNase free Dl-water 90.00ml 

10 Tris-HCl (50mM) 0.79g 

MgC12 (lOmM) 0.20g 

DTT (5mM) 0.078g 

ATP (ImM) 0.55g 
pH to 7.8 with HCL 

15 RNase free Dl-water QS to 1 00.00ml 



The mixture was gently mixed and the RNA was melted by incubating the mixture 
at 16°C for one hour in a temperature controlled refrigerated chamber. RNA was 
precipitated out of the solution immediately after the incubation was completed. 

20 Alcohol Precipitation 

Two volumes (-1000 pi) of ice cold absolute ethanol were added to the reaction 
mixture. The microcentrifuge tube was placed in a microcentrifuge at 15,000 xg for 15 
minutes. The supernate was decanted and discarded and the precipitated RNA was re- 
dissolved in 100 ul DEPC treated water. The mixture was electrophoresed as described in 

25 the following section entitled "Polyacrylamide Gel Electrophoresis (PAGE) of Psoralenated 
RNA Fragments." The following was the order of rate of migration for each fragment in 
order from fastest to slowest: 



30 



a) 



Frag. 2 

5'CUAGA¥CUGGAGG3'-OHPsoraIen (SEQ ID NO: 5) 
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Frag. 3 

5'UCCUGUGT^CGAUCCACAGAAUUCGCACC-Puromycin (SEQ ID 



NO: 6) 



10 



15 



20 



25 



c) Frag 2+3 

S'CUPsoralenAGAYCUGGAGGUCCUGUGWCGAUCCACAGAALTUCGCACC 
Puromycin (SEQ ID NO: 7) 

Each fraction was isolated by UV shadowing, the bands were cut out, the RNAs 
were eluted from the gels and the RNA elute was precipitated out as described in the 
following section entitled "Polyacrylamide Gel Electrophoresis (PAGE) of Psoralenated 
RNA Fragments." The ligation procedure was repeated with any residual unligated 
fragment 2 and 3 fractions. The ligated fractions 2 and 3 were pooled and stored in a small 
volume of RNase free Dl-water at 4°C. 

LIGATION OF RNA FRAGMENT I WITH FRAGMENT 2+3 

All glassware was cleaned to remove any traces of RNase as described in the 
following section entitled "Inactivation of RNases on Equipment, Supplies, and in 
Solutions." The following was added to a new 1.5 ml polypropylene snap capped 
microcentrifuge tube. A 100-1000 pi pipet and new tip was used for each solution: 

Fragment 2+3 (3.0nM) 125.0pl 
Reaction buffer 250.0ul 
T4 Polynucleotide Kinase(5-10U/ml) l,7ul 

Reaction Buffer 

RNase free Dl-water 90.00ml 



Tris-HCl (40mM) 
MgC12 (lOmM) 
DTT (5mM) 
ATP (ImM) 
pH to 7.8 with HCL 



0.63g 
0.20g 
0.08g 
0.006g 



RNase free Dl-water 



QS to 100.00ml 
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The RNA was gently mixed then melted by heating the mixture to 70°C for 5 
minutes in a heating block. The mixture was cooled to room temperature over a two hour 
period and the RNA was allowed to anneal in a tRNA configuration. The RNA was 
5 precipitated out of the solution. 

Alcohol Precipitation 

Two volumes (-1000 ul) of ice cold absolute ethanol were added to the reaction 
mixture. The microcentrifuge tube was placed in a microcentrifuge at 15,000 xg for 15 
minutes. The supernatent was decanted and discarded and the precipitated RNA was re- 
10 dissolved in 100 ^il DEPC treated water. The mixture was electrophoresed as described in 
the following section entitled "Polyacrylamide Gel Electrophoresis (PAGE) of Psoralenated 
RNA Fragments." The following was the order of rate of migration for each fragment in 
order from fastest to slowest: 

a) Frag. 1 

1 5 5'GCGGAUUUAGCUCAGDDGGGAGAGCGCCAGACU3' <SEQ ID NO: 

8) 



b) Frag 2+3 

5'CUPsoraIenAGAYCUGGAGGUCCUGUG-rFCGAUCCACAGAAUUCGCACC 
20 Puromycin (SEQlDNO:6) 

c) Frag. 1+2+3 

5'GCGGAUUUAGCUCAGDDGGGAGAGCGCCAGACUCUPsoralenAGA , FCU 
GGAGGUC. . .CUGUGT^CGAUCCACAGAAUUCGCACCPuromycin (SEQ ID NO: 
25 9) 



Each fraction was isolated by UV shadowing, the bands were cut out, the RNAs 
were eluted from the gels and the RNA elute was precipitated out as described in the 
following section entitled "Polyacrylamide Gel Electrophoresis (PAGE) of Psoralenated 
30 RNA Fragments." The ligation procedure was repeated with the unligated Fragment 1 and 
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the 2+3 Fraction. The ligated fractions 2 +3 were pooled and stored in a small volume of 
RNase free Dl-water at 4°C. 

Final RNA Ligation 

5 The following was added to a new 1.5ml polypropylene snap capped 

microcentrifuge tube. A 100-1000 pi pipet and new tip was used for each solution: 

Fragment 1+2+3 (3.0nM) 250ul 
reaction buffer 250u.l 
RNAT41igase(44pg/ml) 22ug 

10 

The mixture was incubated at 17°C in a temperature controlled refrigerator for 4.7 
hours. Immediately after the incubation the tRNA was precipitated out as described in step 
6.2 above and the tRNA was isolated by electrophoresis as described in the following 
section entitled 'Tolyaerylamide Gel Electrophoresis (PAGE) of Psoralenated RNA 
15 Fragments." The tRNA was pooled in a small volume of RNase free water and stored at 
4°C for up to two weeks or stored at -20°C for periods longer than two weeks. 

POLYACRYLAMIDE GEL ELECTROPHORESIS (PAGE) OF PSORALENATED 
RNA FRAGMENTS 

Acrylamide Gel Preparation 

20 All reagents and glassware were made RNAase free as described in the following 

section entitled "Inactivation of RNases on Equipment, Supplies, and in Solutions." The 
gel apparatus was assembled to produce a 4 mm thick by 20 cm x 42 cm square gel. 29 
parts acrylamide with 1 part ammonium crosslinker were mixed at room temperature with 
the appropriate amount of acrylamide solution in an RNAase free, thick walled Erlenmeyer 

25 flask. 

Acrylamide Solution 
urea(7M) 420.42 g 
TBE(IX) QStolL 

30 5XTBE 
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0.455 M Tris-HCl 53.9g 

lOmMEDTA 20ml of 0.5 M 

RNAase free DI water 900ml 

pH with boric acid to pH 9 

5 QS with RNAase free DI water to 1L 



The mixture was degassed with vacuum pressure for one minute. The appropriate 
amount of TEMED was added, mixed gently, and then the gel mixture was poured between 
the glass plates to within 0.5 cm of the top. The comb was immediately inserted between 
10 the glass sheets and into the gel mixture. An RNAase free gel comb was used. The comb 
produced wells for a 5 mm wide dye lane and 135 mm sample lanes. The gel was allowed 
to polymerize for about 30-40 minutes then the comb was carefully removed. The sample 
wells were rinsed out with a running buffer using a micropipet with a new pipet tip. The 
wells were then filled with running buffer. 

15 Sample Preparation 

An aliquot of the sample was suspended in loading buffer in a snap capped 
microcentrifuge tube and vortex mixed. Indicator dye was not added to the sample. 



loading Buffer 

Urea(7M) 420.42 g 

20 Tris HC1 (50mM) 7.85 g 

QS with RNAase free D-H20 to IL 



Electrophoresis run 

The maximum volume of RNA/loadtng buffer solution was loaded into the 135 mm 
25 sample wells and the appropriate volume of tracking dye in 5 mm tracking lane. The 

samples were electrophoresed in a 5°C refrigerator. The electrophoresis was stopped when 
the tracking dye reached the edge of the gel. The apparatus was then disassembled. Glass 
panels were not removed from the gel. The gel-glass panel unit was placed on a UV light 
box. With UV filtering goggles in place, the UV lights were turned on. The RNA bands 
30 were identified. They appeared as denser shadows under UV lighting conditions. The 
RNA was extracted from the gel. Each band was excised with a new sterile and RNAase 
-27- 



WO 03/044194 



PCT/US02/37103 



free scalpel blade and each band was transferred into a new 1.5 ml snap capped 
microcentrifuge tube. Each gel was crushed against the walls of the microcentrifuge tubes 
with the side of the scalpel blade. A new blade was used for each sample. 1.0 ml of 0.3M 
sodium acetate was added to each tube and eluted for at least 24 hours at 4°C. The eluate 
5 was transferred to a new 0.5 ml snap capped polypropylene microcentrifuge tubes with a 
micropipet with a new RNAase free pipet tip for each tube. Two volumes of ice cold 
ethanol was added to each eluate, then centrifuged at 15,000 xg for 15 minutes in a 
microcentrifuge. The supematants were discarded and the precipitated RNA was 
redissolved in 100 pi of DEPC treated DI water. The RNA was stored in the 
10 microcentrifuge tubes at 4 Q C until needed. 

OLIGONUCLEOTIDE ANALYSIS BY HPLC 

HPLC purification of the RNA oligonucleotides was performed using anion 
exchange chromatography. Either the 2'-protected or 2'-deprotected forms may be 
chroniatographed. The 2'-protected form offered the advantage of minimizing secondary 

15 structure effects and providing resistance to nucleases. If the RNA was fully deprotected, 
sterile conditions were required during purification. 

One skilled in the art will understand that the HPLC purification methods of 
Example 2 may be modified in order to purify the RNA oligonucleotides. Modification of 
the HPLC purification methods of Example 2, including HPLC gradient, temperature, and 

20 other parameters, may be necessary. One of skill in the art would also recognize that a one- 
step HPLC purification method may also be used in accordance with several embodiments 
of the current invention. 

25 

INACTIVATION OF RNASES ON EQUIPMENT, SUPPLIES, AND IN 
SOLUTIONS 

Glassware was treated by baking at 180°C for at least 8 hours. Plasticware was 
30 treated by rinsing with chloroform. Alternatively, all items were soaked in 0. 1% DEPC. 

Treatment with 0. J % DEPC 

0.1% DEPC was prepared. DI water was filtered through a 0.2uM membrane filter. 
The water was autoclaved at 15 psi for 15 minutes on a liquid cycle. l.Og (wt/v) 
DEPC/liter of sterile filtered water was added. 



WO 03/044194 



PCT/US02/37103 



Glass and Plasticware 

All glass and plasticware was submerged in 0.1% DEPC for two hours at 37°C. 
The glassware was rinsed at least 5X with sterile DI water. The glassware was heated to 
5 100°C for 15 minutes or autoclaved for 15 minutes at 15 psi on a liquid cycle. 

Electrophoresis Tanks Used for Electrophoresis ofRNA 

Tanks were washed with detergent, rinsed with water then ethanol and air dried. 
The tank was filled with 3% (Wv) hydrogen peroxide (30ml/L) and left standing for 10 
minutes at room temperature. The tank was rinsed at least 5 times with DEPC treated 
10 water. 

Solutions 

All solutions were made using Rnase free glassware, plastic ware, autoclaved water, 
chemicals reserved for work with RNA and RNase free spatulas. Disposable gloves were 
used. When possible, the solutions were treated with 0.1% DEPC for at least 12 hours at 
15 37°C and then heated to 100T for 15 minutes or autoclaved for 15 minutes at 15 psi on a 
liquid cycle. 

RNA TRANSLATION 

2 pi of gastroinhibitory peptide (GIP) mRNA at a concentration of 20 pl/ml was 
placed in a 250 pi snapcap polypropylene microcentrifuge tube. 35 ul of rabbit reticulocyte 

20 lysate (available commercially from Promega) was added. 1 pi of amino acid mixture 

which did not contain methionine (available commercially from Promega) was added. 1 pi 
of 35S methione or unlabeled methionine was added. Optionally, 2 ml of luciferase may be 
added to some tubes to serve as a control. In a preferred embodiment, luciferase was used 
instead of GIP mRNA. One skilled in the art will understand that indeed any mRNA 

25 fragment containing the appropriate sequences may be used. 

SATA was added to the experimental tubes. Control tubes which did not contain 
SATA were also prepared. The quantity of SATA used was approximately between 0.1 pg 
to 500 pg, preferably between 0.5 pg to 50 pg. 1 pi of Rnasin at 40 units/ml was added. 
Nuclease free water was added to make a total volume of 50 pi. 
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For proteins greater than approximately 150 amino acids, the amount of tRNA 
may need to be supplemented. For example, approximately 10 - 200 pg of tRNA may be 
added. In general, the quantity of the SATA should be high enough to effectively suppress 
stop or pseudo stop codons. The quantity of the native tRNA must be high enough to out 
5 compete the SATA which does not undergo dynamic proofreading under the action of 

elongation factors. 

Each tube was immediately capped, parafilmed and incubated for the translation 
reactions at 30°C for 90 minutes. The contents of each reaction tube was transferred into a 
50 pi quartz capillary tube by capillary action. The SATA was crosslinked with mRNA by 

10 illuminating the contents of each tube with 2-10 J/cm2 ~350 ran wavelength light, as per 
Gasparro et al. {Photochem. Photobiol. 57:1007 (1993), herein incorporated by reference). 
Following photocrosslinking, the contents of each tube were transferred into a new snapcap 
microfuge tube. The ribosomes were dissociated by chelating the calcium cations by 
adding 2 pi of 1 0 mM EDTA to each tube. Between each step, each tube was gently mixed 

15 by stirring each component with a pipette tip upon addition. 

The optimal RNA for a translation was determined prior to performing definitive 
experiments. Serial dilutions may be required to find the optimal concentration of mRNA 
between 5-20 pg/ml. 
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Reagent 


1 


2 


3 


4 


Rabbit reticulocyte lysate (35ul) 


+ 


+ 


+ 


+ 


Amino acid mixture minus 
methionine (lul of 1 mM) 


+ 


+ 


+. 


+ 


J> S Methionine (lul of 
l,200Ci/mmol) 


+ 






+ 


Methionine (unlabeled) 




+ 


+ 




GIP mRNA (2jji1 of 20 fig/ml) 


+ 








J2 P GIF rnRNA (2 M 1 of 20 ug/ml) 




+ 


+ 




Rnasin (lul of 40 U/ul) 


+ 


+ 


.+ 




SATA 










Water, nuclease free (q.s. to 50 ul) 




+ 


+ 


+ 



SDS-Page electrophoresis was performed on each sample, as described above. 
Autoradiography on the gel was performed, as described by Sambrook et. al., Molecular 
Cloning, A Laboratory Manual, 2 nd ed., Coldspring Harbor Press (1989), herein 
5 incorporated by reference. 

In another example, the SATA was produced in a manner similar to the above 
methodology, except that uridines were substituted with pseudouridines. This technique is 
discussed below in Example 2. 

10 EXAMPLE 2; PRQPUCDIQN OF T H E S ATA USING PSEUPOVRIDINE 

As discussed above, one skilled in the art will appreciate that the SATA can be 
produced in a number of different ways. Figure 5 shows the chemical structures for uridine 
and pseudouridine. Pseudouridine is a naturally occurring base found in tRNA that forms 
15 hydrogen bonds just as uridine does, but lacks the 5-6 double bond that is the target for 

psoralen. In a preferred embodiment, the SATA was produced using pseudouridine. 
Specifically, in a preferred embodiment, three fragments (Fig. 1) were purchased from a 
commercial source (Dharmacon Research Inc., Boulder, CO). Modified bases and a 
fragment 3 ("Fragment 3") with a pre-attached puromycin on its 3* end and a P0 4 on its 3' 
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end were included, all of which are available commercially. The three fragments were used 
to facilitate manipulation of a fragment 2 ("Fragment 2") in forming the monoadduct. 
Preferred sequences of the three fragments are as follows: 
Fragment 1 

5 STO^CGGAUUUAGCUCAGUUGGGAGAGCGCCAGACOrB' (SEQ ID NO: 10) 

Fragment 2 
S'OHNKCUAACTCOHS' (SEQID NO: 11) 

10 Fragment 3 

S'PO.UGGAGGUCCUGUGdTH'CGAUCCACAGAAUUCGCACCPuromycinS' (SEQ ID 
NO: 12) 

Yeast tRNAAla or yeast tRNAPhe was preferably used. However, one skilled in 
15 the art will understand that sequences can be chosen widely from known tRNAs or by 

selecting sequences that will form into a tRNA-like structure. Using pseudouridine instead 
of uridine in Fragment 2 avoids psoralen labeling of the nontarget U's. 

Fragment 2 was preferably used in a helical conformation to induce the psoralen to 
intercalate. Accordingly, a complementary strand was required. RNA or DNA was used, 
20 and a sequence, such as poly C to one or both ends, was added to facilitate separation and 

removal after monoadduct formation was accomplished. Use of pseudouridine instead of 
uridines in the complement means that a high efficiency wave length, such as 365 nm, can 
be used with out fear of crosslinking the product. Irradiation was preferably in the range of 
about 300-450 nm, more preferably in the range of about 320 to 400 nm, and most 
25 preferably about 365 nm. Further, use of pseudouridine left the furan sided monoadduct in 

place on Fragment 2 because the Maf is the predominate first step in the crosslink 
formation. 

The following cRNA sequence with pseudo uridine was used according to a 
preferred embodiment of the present invention. : 
30 CCCTCCAGAGT^AGACCC (SEQ ID NO: 13) 

Step 1: Furan sided monoadduction of psoralen to fragment 2 
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The formation of a furan sided psoralen monoadduct with the target uridine of 
Fragment 2 was performed as follows: 

A reaction buffer was prepared as follows: 
TfisHCL 25 mM 



4'hydroxy methyl-4,5\8'-triethyl psoralen (HMT) was then added to a final 
concentration of 0.32 mM and equimolar amounts of fragment 2 and cRNA were added to a 
final molar ratio of fragment 2: cRNA : psoralen = 1:1:1000. A total volume of lOOpl was 
irradiated at a time. 

The mixture of complementary oligos, HMT, psoralen was processed as follows: 

1) Heated to 85° C for 60 sec followed by cooling to 4° C over 15 min, using PCR 
thermocycler. 

2) Irradiated for 20 min at 4° C, in Eppendorf UVette plastic cuvette, covered top 
with parafilrn, laid on the top of UV lamp (lmW/cm 2 multi-wavelength UV lamp 
(X>300nm) (UV L21 model X 365 nm). 

Steps 1 and 2 above were repeated 4 times to re-intercalate and irradiate HMT. 
After the second irradiation additional 10 pi of 1.6 mM HMT was added in total lOOul 
reaction volume. After 4 cycles of irradiation, the free psoralens were extracted with 
chloroform and all oligos (labeled and unlabeled) were precipitated with ethanol overnight 
(see precipitation step). A small aliquot was saved for gel identification. 

Step 2: Purification of HMT conjugated fragment 2 (2MA) otigo by reverse-phase HPLC 

1) The reaction mixture Was dried with speed vacuum for 10 minutes and then 
was dissolved with 2 pi of 0.1 M TEAA, pH 7.0 buffer. 



NaCl 

EDTA 

pH 



100 mM 
0.32 mM 
7.0 




Acetic Acid 



5.6 ml 



Triethylamine 13.86 ml 

H 2 0 (RNAase free) 950 ml 
pH adjusted to 7.0 with acetic acid 
and water added to 1L 
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2) The sample was loaded ontoa Waters Xterra MS CI 8, 2.5 um, 4.5x50 mm 
reverse-phase column pre-equilibrated with buffer A (5% wt/wt acetonitrile in 0.1M TEAA, 
pH 7.0) The sample was eluted with a gradient of 0-55% buffer B (15% wt/wt acetonitrile 
5 in 0.1M TEAA, pH 7.0) to buffer A over a 35 minute time frame at a flow rate of 1 

ml/minute. The column temperature was 60°C and the detection wave length, set by a 
narrow band filter, was 340 nm. Furan sided psoralen monoadduct absorbs at 340 nm but 
the RNA, and any pyrone sided monoadduct does not. The buffer solutions were filtered 
and degassed before use. 

10 The 2MA eluted at around 25-28 minutes at a buffer B concentration of 40%. 

Unpsoralenated fragment 2 eluted before 8 minutes based on subsequent gel electrophoresis 
analysis on collected fractions. 

The column was washed with 100% acetonitrile for 5 minutes and was re- 
equilibrated with buffer A for 15 minutes. All fractions were dried with speed vacuum 

15 overnight. 

The fractions containing the 2MA were identified by the level of absorbance at 260 
nm (RNA) and 330 nm (furan sided psoralen monoadducted RNA). This was done by 
redissolving the dried fractions with 1 20 ul of Rnase-free distilled water and the absorbance 
was measured with a spectrophotometer at 260 nm and 330 nm. The fractions with high 
20 absorbance at both wavelengths were pooled then dried with speed vacuum. A small 
aliquot from each was saved for gel analysis. 

The cross-linked products were analyzed on a denaturing 20% TBE-urea gel and 
visualized by gel silver staining. 

25 Step 3; Purification of HMT conjugated fragment 2 oligo from cRNA by anion exchange 

HPLC 

The dried samples were pooled and then were dissolved with 0.5X TE buffer. A 
sample of about 0.4 absorbance unit was loaded onto a Dionex DNAPac PA-100 
30 (4x250mm) column which was pre-equilibrated with buffer C (25 mM Tris-HCl, pH 8.0) 
and the column temperature was 85°C. 

The oligos were eluted at a flow rate of 1 ml/min. with a concave gradient from 4% 
to 55% buffer D for 15 minutes followed by a convex gradient from 55 % to 80% with 
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buffer D for the next 15 minutes. The oligos were washed with 100% buffer D for 5 rnin 
and 100% buffer C for another 5 min at a flow rate of 1.5 ml/min; Fractions were collected 
that absorbed 260 nm light. 2MA had a retention time (RT) of 16.2 minutes and was eluted 
by 57% buffer D, and free fragment 2 had RT less than 16.6 minutes, and was eluted by 
5 55% buffer D and free cRNA had RT greater than 19.2 minutes. The fractions were 
collected that absorbed at 254 or 260 nm. The collected fractions were dried with speed 
vacuum overnight. All solutions were filtered and degassed before use. 
The solution used compri sed the following: 
C: 25rnM Tris-HCl pH 8.0; 
10 D: 250mM NaC104 in 25mM Tris pH 8.0 buffer. 

TE: 1 OmM Tris-HCl pH 8.0 with 1 mM EDTA 

Step 4: Desalting, precipitation and collection of the purified 2MA oligo 

The dried fractions were redesolved with lOOul Rnase free distilled water. 500ul 
15 cool 100% ethanol with 0.5M (NH4)2C03 was added and the mixture was vortexed 
briefly. The mixture was then frozen on dry ice for 60 minutes or stored at 
-20°C overnight. 

The samples were then brought to 4° C and centrifuged at maximum speed in a 
microcentrifuge for 15 minutes. The position of the pellet was noted and the supernatant 
20 was decanted or removed by pipette. Care was taken not to disturb pellet. If the pellet still 
contained salt, this step was repeated. The pellet was then washed with 70% pre-cooled 
ethanol twice. The wet pellet was dried with speed vacuum for 15 min. 
Urea PAGE gel identified the right fractions for the next step. 

25 Step 5: Ligation of 2M A oligo to Fragment 3 oligo: 

A. The following reagents and instruments were used: 

• Nuclease-Free Water (Promega ) 

• polyethylene glycol (PEG8000 Sigma) 40%(w]/wt in water) 
30 • RNasin ® Ribonuclease Inhibitor (Promega) 

• phenol:chloroform 

• 1 .5 ml sterile microcentrifuge tubes 
. • 100% ethanol 
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• 70% ethanol 

• Dry ice or -20°C freezer 

• Microcentrifuge at room temperature and +4°C 

• PCR thermocycler or water bath 

5 

B. The following reaction conditions were used: 

• 50mMTris-HCl(pH7.8), 

• 10mMMgC12, 

• lOmMDTT 
10 • ImMATPand 

• 18-20% PEG 

C. The following reaction mixture was assembled in a sterile microcentifuge tube: 

• Fragment 3 (Donor) 1 pi (6pg) (Purified, when necessary, before using as a donor) 
15 • 2MA (Acceptor) lpl(1.5ug 

After adding 8 pi Rnase free dH20 8ul, the reactions were incubated at 85° C for 1 
minute to relax the oligo secondary structure, then slowly cooled to 4° C, using a PCR 
machine thermocycler. The preheated tube was placed on ice to keep cool and centrifuged 
briefly, then the following was added: 
20 • lOXLigase Buffer 4ul 

• lOmMATP 4ul 

• Rnase Out or Rnasin(40u/pl) Promega 0.5pl 

• PEG, 40 % (Sigma) 20pl 

• T4 RNA Ligase (lOu/ul) (NEB) IpJ 

25. 

Nuclease-free water was added to final Volume of 40 ul The mixture was incubate at 16° 
C overnight(16hr). The mixture was centrifuged briefly and then was placed on ice. 

D. Precipitation of Oligonucleotides: 

30 60ul DEPC RNase free distilled water was added to the mixture and then 150 pi 

phenol/chloroform was added. The mixture was vortexed vigorously for 30 seconds. The 
precipitate was then centrifuged out at maximum speed in a microcentifuge for 5 minutes at 
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room temperature. The aqueous phase was transferred to a new microcentrifuge tube 
(>95uJ). 

To this was added 3ul 5 mg/ml glycogen, and 500u.l pre-cooled 100% ethanol with 
0.5M (NH4)2C03 and the mixture was vortexed briefly and then was frozen on dry ice for 
5 60 minutes. At this point, it may be stored overnight at -20°C. The dried fractions were 

redissolved with lOOpl Rnase-free distilled water, 500ul cool 100% ethanol with 0.5M 
(NH4)2C03 was added and vortexed briefly. This was then frozen on dry ice for 60 
minutes or stored at -20C overnight. The samples were then brought to 4° C and 
centrifuged at maximum speed in a microcentrifuge for 15 minutes and supernatant 

10 removed by pipette. Care was taken not to disturb pellet. If the pellet still contained salt, 

this step was repeated once. The pellet was then washed with 70% pre-cooled ethanol 
several times. This was then centrifuged at maximum speed in a microcentrifuge for 5 
minutes at 4C. The ethanol was carefully removed using a pipette. Centrimgation was 
repeated again to collect remaining ethanol which was carefully removed. The wet pellet 

1 5 was dried with speed vacuum for 10 min. A small aliquot was collected for the gel analysis. 

For long term storage, the RNA was stored in ethanol at -20C. Care was taken not to store 
the RNA in DEPC water. 

Step 6: Purification of the ligated fragment 3 oligo complex 

20 The dried sample was redesolved with 0.5X TE buffer and was loaded onto a 

DNAPac PA-100 column which was equilibrated with buffer C. The column temperature 
was 85°C and the detector operated at 254 nm to identify fractions with RNA and at 340 
ran to identify fractions with 2MaF. The oligos were eluted with a convex gradient from 
30% to 70% with buffer D for the first 20 minutes at a flow rate of 0.8ml/min and followed 

25 with a linear gradient from 70 % to 98% D for another 20 min at the same flow rate. The 

elution was completed by washing with 100% D for 7 min and 100% C for another 10 min 
at 1 .0 ml/min flow ratej The fractions were detected with 254 or 260 nm wavelength light. 
The ligated oligos (2MA-fragment 3) were eluted after 34 min, by more than 90% buffer B. 
Fractions with 254 nm absorbance (A254nm>0.01) were collected and dried with speed 

30 vacuum overnight. 

Step 7: Purified 2MA-fragment 3 desalting and precipitation: 
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The dried fractions were re-dissolved with lOOul Rnase free distilled water,_500ul 
cool 100% ethanol with 0.5M (NH4)2C03 was added and the mixture was vortexed 
briefly. The mixture was then frozen on dry ice for 60 minutes or stored at -20C overnight. 

The samples were brought to 4° C and centrifuged at maximum speed in a 
microcentrifuge for 15 minutes. The position of the pellet was noted and the supernatant 
decanted or removed by pipette. Care was taken not to disturb pellet. If still containing salt, 
this step was repeated. The pellet was then washed with 70% pre-cooled ethanol twice. The 
wet pellet was dried with speed vacuum for 15 min. 

Urea PAGE was performed to identify the iigated 2MA-fragment-3 for use in the next step 
of ligating fragment 1 to the 2MA-fragment-3 oligo which completes the SATA linker. 

Step 8: Preparation of SA TA 

A. RNA Oligo 5 'phosphorylation 

1 . Reagent and instrument: 

• Nuclease-Free Water (CatJPll 93 Promega) 

• RNasin ® Ribonuclease Inhibitor (Cat# N25 1 1 Promega) 

• Phenolxhloroform 

• Sterile microcentrifuge tubes 

• 100% ethanol 

• 70% ethanol 

• Microcentrifuge at room temperature and +4°C 

• PCR fhermalcycler or water bath 

2. Assemble the following reaction mixture in a sterile microcentrifuge tube: 





Component 


Volume 


• 


Acceptor RNA 


<200ng 


• 


T4 ligase 10X Reaction Buffer* 


4ul 




RNasin ® Ribonuclease Inhibitor (40u/uJ) 


20unit 




T4 kinase (9- 12u/ul) 


2ul 




10 mM ATP 


4ul 


• 


Nuclease-Free Water to final volume 


40ul 
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Incubate at 37°C for 30 minutes in a PCR thermocycler or water bath. For non- 
radioactive phosphorylation, use up to 300 pmol of 5' termini in a 30 to 40 ul reaction 
containing IX T4 Polynucleotide Kinase Reaction Buffer, 1 mM ATP and 10 to 20 units of 
T4 Polynucleotide Kinase. Incubate at 37°C for 30 minutes. IX T4 DNA Ligase Reaction 
5 Buffer contains 1 mM ATP and can be substituted in non -radioactive phosphorylations. T4 

Polynucleotide Kinase exhibits 100% activity in this buffer). Fresh buffer is required for 
optimal activity (in older buffers, loss of DTT due to oxidation lowers activity. 

B. Annealing Fragmentl and 2MA-fragment 3 oligo complex: 
10 1 . Reagents and instruments: 

• PCR thermocycler instrument or water bath 

• lOOug/mlnuclease-free albumin 

• 100mMMgC12 

1 iS 2. Assemble the following reaction mixture in a sterile microcentrifuge tube: 

• Acceptor RNA oligo (IE) <200ng 

• Donor RNA oligo (3G-2G ligated oligo) <200ng 

• (5' phosphorylated oligo from step A) 

20 Appropriate ratios are as follows: Acceptor oligo:Donor oligo (Fragment 1 : 2MA- 

Fragment 3) molar ratio should be 1:1.1 to avoid fragment I self-ligation. MgCt 2 was 
added to T4 ligase buffer (50mM Tri$-HCL-(pH 7.8), 10 mM MgCl 2 , 10 mM DTT and 
ImM ATP) to final 20 mM concentration. Add Rnase free albumin to final 5ug/ml. The 
final volume should be no more than lOOul. The solution was heated to 70° C for 5 min, 

25 then was cooled from 70° C to 26" C over 2 hours and cooled from 26°C to 0°C over 40 

minutes. Incubate at 16°C for 16 to 17 hours using PCR instrument. 

C. Ligation of annealed oligos: 



• Annealed oligos < 15 U 1 
30 . lOmMATP 2ul 

• 40% PEG 18ul 

• T4 ligase 10X Buffer 2ul 
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• RNasin ® Ribonuclease Inhibitor (40u/pl) 0.5ul 

• T4 ligase (9-12u/pl)(NEB) lul 

• Nuclease-Free Water to final volume 40pl 

D. Precipitating tRNA fragment: 

After ligation, 50 pi DEPC water and 150 pi phenol: chloroform were added and 
vortexed vigorously for 30 seconds. This was then centrifuged at maximum speed in a 
microcentrifuge for 5 minutes at room temperature. The aqueous phase was transferred to a 
new microcentrifuge tube (-100 pi). To this was added 2 pi lOmg/ml mussel glycogen, 
lOpl 3M sodium acetate, pH 5.2. This was mixed well. Then 220pl 95% ethanol was 
added and vortexed briefly. The mixture was then frozen on dry ice for 30 minutes. At this 
point the mixture may be stored over night at -20 Q C or one may proceed. The RNA must 
not be stored in DEPC water, but in ethanol, at -20° C. 

Then the samples were brought to 4°C and centrifuged at maximum speed in a 
microcentrifuge for 15 minutes. The position of the pellet was noted and the supernatant 
decanted or removed by pipette. Care was taken not to disturb pellet. The pellet was then 
washed with 70% pre-cooled ethanol twice. After removing the ethanol, the wet pellet was 
dried with a speed vacuum for 15 min. The dried pellet was stored at -20°C, until the next 
step. 

RNA Translation 

A luciferase mRNA which was modified to have the stop codon corresponding to 
that recognized by the anticodon of the SATA ( in the present case UAG) was used in a 
standard Prorhega in vitro translation kit in the recommended lpl of concentration lpg/pl. 
One skilled in the art will understand that indeed any mRNA fragment containing the 
appropriate sequences may be used. 

SATA was added to the experimental tubes. Control tubes which did not contain 
SATA were also prepared. The quantity of SATA used was approximately between 0.1 pg 
to 500 pg, preferably between 0.5 pg to 50 pg. 1 pi of Rnasin at 40 units/ml was added. 
Nuclease free water was added to make a total volume of 50 pi. 

For proteins greater than approximately 150 amino acids, the amount of tRNA 
may need to be supplemented. For example, approximately 10 - 200 pg of tRNA may be 
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added. In general, the quantity of the SATA should be high enough to effectively suppress 
stop or pseudo stop codons. The quantity of the native tRNA must be high enough to out 
compete the SATA which does not undergo dynamic proofreading under the action of 
elongation factors. 

Each tube was immediately capped, parafilmed and incubated for the translation 
reactions at 30°C for 90 minutes. The contents of each reaction tube was transferred into a 
50 pi quartz capillary tube by capillary action. The SATA was crosslinked with mRNA by 
illuminating the contents of each tube with 2-10 J/cm2 -350 nm wavelength light, as per 
Gasparro et al. (Photochem. Photobiol. 57:1007 (1993), herein incorporated by reference). 
Following photocrosslinking, the contents of each tube were transferred into a new snapcap 
microfuge tube. The ribosomes were dissociated by chelating the calcium cations by 
adding 2 pi of 10 raM EDTA to each tube. Between each step, each tube was gently mixed 
by stirring each component with a pipette tip upon addition. 

The optimal RNA for a translation was determined prior to performing definitive 
experiments. Serial dilutions may be required to find the optimal concentration of mRNA 
between 5-20 pg/ml. 

SDS-Page electrophoresis was performed on each sample, as described above. 
Autoradiography on the gel was performed, as described by Sambrook et al., Molecular 
Cloning, A Laboratory Manual, 2 nd ed., Coldspring Harbor Press (1989), herein 
incorporated by reference. 

EXAMPLE 3: ALTERNATIVE SEQUENCES 

In a preferred embodiment, Fragments 1, 2 and 3, described above in Example 1, 
have the following alternate sequences: 

Fragment 1 (SEQ ID NO: 13): 
5' PQ4 GCGGAUUUAGCUCAGUUGGGAGAGCGCCAGA N3-Methyl-U 3' 
Fragment 2 (SEQ ID NO: 14): 

5' UCUAAGTCTOGAGG 3' 

Fragment 3 

(Unchanged from the sequence listed above (SEQ ID NO: 6)): 
5' P04 UCCUGUGPFCGAUCCACAGAAUUCGCACC Puromycin 3' 
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Using the methods described above, the sequence of alternative Fragments 1+2+3 
was (SEQ ID NO: 15): 

5T04 GCGGAUUUAGCUCAGUUGGGAGAGCGCCAGA N3-Methyl 

UUCUPsoralenAAGTC^GGAGGUCCUGUGTYCGAUCCACAGAAUUCG 
Puromycin 3' 

While a number of preferred embodiments of the current invention and variations 
thereof have been described in detail, other modifications and methods of use will be 
readily apparent to those of skill in the art. Accordingly, it should be understood that 
various applications, modifications and substitutions may be made without departing from 
the spirit of the invention or the scope of the claims. 
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WHAT IS CT.ATMFX) IS- 

1 . A method for selecting a desired protein or nucleic acid molecule comprising: 

providing at least two candidate mRNA molecules, wherein at least one of 
5 said mRNA molecules contains at least one codon selected from the group 

consisting of a stop codon and a pseudo stop codon; 

translating at least two of said candidate mRNA molecules to generate at 
least one translated protein; 

linking at least one of said candidate mRNA molecules to its corresponding 
10 translated protein to form at least one cognate pair; 

selecting one or more of said cognate pairs based upon the properties of said 
translated protein or said mRNA molecule; and 

selecting said translated protein or said desired nucleic acid molecule 
comprising identifying a molecule selected from the group consisting of an mRNA 
15 molecule of said selected cognate pair, a nucleic acid molecule complementary to 

said mRNA molecule and a nucleic acid molecule homologous to said mRNA 
molecule. 

2. The method of Claim 1 , wherein at least one of said candidate mRNA molecules 
20 and at least one said translated protein is linked by a tRNA molecule selected from the 

group consisting of tRNA, modified tRNA and tRNA analogs. 

3. The method of Claim 2, wherein said tRNA molecule is connected to said 
translated protein by ribosomal peptidyl transferase. 

25 

4. The method of Claim 2, wherein said tRNA molecule is connected to the 
mRNA through an ultraviolet-induced cross link between the anticodon of said tRNA 
molecule and the codon of the RNA message. 

30 5. The method of Claim 2, wherein said tRNA molecule is connected to the 

mRNA by thiouracil. 
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6. The method of Claim 2, wherein said tRNA molecule is connected to the 
mRNA by a psoralen cross link. 



7. The method of Claim 2, wherein said tRNA molecule is connected to the 
mRNA by using a psoralen monoadduct. 

8. The method of Claim 7, wherein said psoralen monoadduct is placed on said 
mRNA. 

9. The method of Claim 7, wherein said psoralen monoadduct is placed on an 
anticodon of said tRNA molecule. 

10. The method of Claim 7, wherein said psoralen monoadduct is placed on a stop 
anticodon of said tRNA molecule. 

11. The method of Claim 7, wherein said psoralen monoadduct is generated by 
forming crosslinks and photo reversing said crosslinks or by using visible light. 

12. The method of Claim 7, wherein said psoralen monoadduct is generated using 
light of a wavelength in the range of about 300 to about 450 nm. 

13. The method of Claim 12, wherein said wavelength is 365 nm. 

14. The method of Claim 2, wherein said tRNA molecule has a stable peptide 
acceptor. 

15. The method of Claim 2, wherein said tRNA molecule is modified to accept a 
peptide chain and hold said chain in a stable manner, such that peptidyl transferase cannot 
detach it. 

16. The method of Claim 2, wherein said tRNA molecule is modified by using a 
bond which binds to the ribosome, accepts the peptide chain, and then does not act as a 
donor in the next transpeptidation. 
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17. The method of Claim 16, wherein said bond is selected from the group 
consisting of a 2' ester on a 3' deoxy adenosine and an amino acyl tRNA^^. 

18. The method of Claim 2, wherein an amino acid or amino acid analog is attached 
to the 3' end of said tRNA molecule by a stable bond to generate a stable aminoacyl tRNA 
analog. 

19. The method of Claim 1, wherein said translating is performed in vitro. 

20. The method of Claim 1, wherein said translating is performed in situ. 

21. The method of Claim 1, wherein said cognate pair is selected based upon ligand 
binding. 

22. The method of Claim 1, wherein selecting said desired nucleic acid molecule 



having an array of nucleic acids, wherein said nucleic acids are placed in a 
predetermined position; 

hybridizing at least one of said cognate pairs onto said array; and 
identifying the mRNA based upon its reaction to a protein identifier. 

23. The method of Claim 1, wherein said reaction to a protein identifier is selected 
from the group consisting of ligand binding, immunoprecipitation and enzymatic reactions. 

24. A kit to generate cognate pairs comprising at least one psoralen monoadduct 
attached to a nonadducted stable aminoacyl tRNA analog or at least one psoralen 
monoadduct attached to an oligonucleotide. 

25. A method for evolving a desired protein sequence comprising: 

providing at least two candidate mRNA molecules, wherein at least one of 
said mRNA molecules contains at least one codon selected from the group 
consisting of a stop codon and a pseudo stop codon; 

translating at least two of said candidate mRNA molecules to generate at 
least one translated first protein; 
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linking at least one of said candidate mRNA molecules to its corresponding 
translated first protein to form at least one first cognate pair; 

selecting at least one of said first cognate pairs based upon at least one 
desired characteristic of said translated first protein or said mRNA; 
5 recovering at least one of said first cognate pairs with said desired 

characteristic to generate at least one recovered cognate pair, 

amplifying a first nucleic acid component of one or more of said recovered 
cognate pairs; 

producing at least one second nucleic acid component comprising at least 
10 one of said first nucleic acid components with one or more variations; 

producing at least one second protein by translating at least one of said 
second nucleic acid components; 

linking at least one of said second proteins with at least one of said second 
nucleic acid components to generate one or more second cognate pairs; and 
15 obtaining the desired protein sequence by re-selecting one or more of said 

second cognate pairs based upon at least one desired property, wherein said desired 
property is the same or different than said desired characteristic. 

26. The method of Claim 25, wherein said desired characteristic is selected from the 
group consisting of binding properties, enzymatic reactions and chemical modifications. 

20 

27. The method of Claim 25, wherein selecting at least one of said first cognate 
pairs comprises: 

providing a first ligand with a desired binding characteristic; 
contacting one or more of said first cognate pairs with said first ligand to 
25 generate unbound complexes and bound complexes; 

recovering either the bound complexes or the unbound complexes; 
amplifying at least one nucleic acid component of the recovered complexes; 
introducing variation to a sequence of said nucleic acid component of said 
recovered complexes; 

30 translating one or more second proteins from said nucleic acid components, 

linking at least one of said second proteins with at least one of said second 
nucleic acid components to generate one or more second cognate pairs; and 
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obtaining the desired protein sequence by contacting said at least one of said 
second cognate pairs with at least one second ligand to select one or more of said 
second cognate pairs, wherein said second ligand is the same or different than said 
first ligand. 

5 

28. A method of forming a monoadduct comprising: 

providing a target oligonucleotide comprising at least one uridine and at 
least one modified uridine, 

contacting said target oligonucleotide with psoralen, and 
1 0 coupling said psoralen to said target oligonucleotide to form a monoadduct. 

29. A method according to Claim 28, wherein said modified uridine is modified to 
avoid coupling with psoralen. 

I 5 30. A method according to Claim 28, wherein said modified uridine is 

pseudouridine. 

31. A method according to Claim 28, wherein said target oligonucleotide is a tRNA 
molecule selected from the group consisting of tRNA, modified tRNA and tRNA analogs. 

20 

32. A method according to Claim 28, wherein said target oligonucleotide is a 
mRNA molecule selected from the group consisting of mRNA, modified mRNA and 
mRNA analogs. 

25 33. A method according to Claim 28, wherein psoralen is coupled to said target 

oligonucleotide by one or more cross-links. 

34. A method according to Claim 28, wherein the method further comprises 
providing a second oligonucleotide comprising a nucleotide sequence complementary to the 

30 target oligonucleotide sequence, wherein the second oligonucleotide contains no uridine. 

35. A method according to Claim 28, wherein the method further comprises 
providing a second oligonucleotide comprising a nucleotide sequence complementary to the 
target oligonucleotide sequence, wherein said second oligonucleotide comprises uridine 
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residues and wherein the uridine residues are modified to avoid cross-linking with the 
target oligonucleotide. 

36. A method according to Claim 35, wherein the modified uridine is 
pseudouridine. 
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Figure 1 
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Figure 2 
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Figure 3 
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Figure 4 
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Figure 5 
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